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STOCHASTIC NONLINEAR GALAXY BIASING 

Avishai Dekel ^ and Ofer Lahav ^'^ 
ABSTRACT 

We propose a general formalism for galaxy biasing and apply it to methods 
for measuring cosmological parameters, such as regression of light versus mass, 
the analysis of redshift distortions, measures involving skewness and the cosmic 
virial theorem. The common linear and deterministic relation g = b6 between the 
density fluctuation fields of galaxies g and mass 6 is replaced by the conditional 
distribution P{g\S) of these random fields, smoothed at a given scale at a given 
time. The nonlinearity is characterized by the conditional mean {g\S) = b{6)6, 
while the local scatter is represented by the conditional variance C3"^((5) and higher 
moments. The scatter arises from hidden factors affecting galaxy formation and 
from shot noise unless it has been properly removed. 

For applications involving second-order local moments, the biasing is defined 
by three natural parameters: the slope b of the regression of g on6, a nonlinearity 
b, and a scatter (Jb- The ratio of variances b^^^. and the correlation coefficient r 
mix these parameters. The nonlinearity and the scatter lead to underestimates of 
order iP /IP and cr^/iP in the different estimators of /? (~ ^l^'^ /b). The nonlinear 
effects are typically smaller. 

Local stochasticity affects the redshift-distortion analysis only by limiting the 
useful range of scales, especially for power spectra. In this range, for linear 
stochastic biasing, the analysis reduces to Kaiser's formula for b (not 6var), inde- 
pendent of the scatter. The distortion analysis is affected by nonlinear properties 
of biasing but in a weak way. 

Estimates of the nontrivial features of the biasing scheme are made based 
on simulations and toy models, and strategies for measuring them are discussed. 
They may partly explain the range of estimates for [3. 
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Subject headings: cosmology: theory — cosmology: observation — dark matter 

— galaxies: distances and icdshifts — galaxies: formation — galaxies: clustering 

— large-scale structure of universe 

1. INTRODUCTION 

Galaxy "biasing" clearly exists. The fact that galaxies of different types cluster differ- 
ently (e.g., Dressier 1980; Lahav, Nemiroff & Piran 1990, Santiago & Strauss 1992; Loveday 
et al. 1995; Hermit et al. 1996; Guzzo et al. 1997) implies that not all of them are exact 
tracers of the underlying mass distribution. It is obvious from the emptiness of large voids 
(e.g., Kirshner et al. 1987) and the spikiness of the galaxy distribution with ~ 100/i~^Mpc 
spacing (e.g., Broadhurst et al. 1992), especially at high redshifts (Steidel et al. 1996; 1998), 
that if the structure has evolved by standard gravitational instability theory (GI) then the 
galaxy distribution must be biased. 

Arguments for different kinds of biasing schemes have been put forward and physical 
mechanisms for biasing have been proposed (e.g., Kaiser 1984; Davis et al. 1985; Bardeen et 
al. 1996; Dekel & Silk 1986; Dekel & Rees 1987; Braun, Dckcl & Shapiro 1988; Babul & White 
1991; Lahav & Saslaw 1992). Cosmological simulations of galaxy formation clearly indicate 
galaxy biasing, even at the level of galactic halos (e.g., Cen & Ostriker 1992; Kauffmann, 
Nusser & Steinmetz 1997; Blanton et al. 1998; Somerville et al. 1998). The biasing becomes 
stronger at higher redshifts (e.g., Bagla 1998a, 1998b; Jing & Suto 1998; Wechsler et al. 
1998). 

The biasing scheme is interesting by itself as a constraint on the process of galaxy 
formation, but it is of even greater importance in many attempts to estimate the cosmological 
density parameter Vl. If one assumes a linear and deterministic biasing relation of the 
sort g = bS between the density fluctuations of galaxies and mass, and applies the linear 
approximation for gravitational instability, V-w = —f{Q)S with f{Q) ~ (e.g. Peebles 
1980), then the observables g and V-f are related via the degenerate combination j3 = f{Q)/b. 
Thus, one cannot pretend to have determined fl by measuring /? without a detailed knowledge 
of the relevant biasing scheme. 

It turns out that different methods lead to different estimates of f3, sometimes from 
the same data themselves (for reviews see Dekel 1994, Table 1; Strauss & Willick 1995, 
Table 3; Dekel, Burstein & White 1997; Dekel 1998a). Most recent estimates for optical 
and IRAS galaxies he in the range 0.4 < < 1. The methods include, for example: (a) 
comparisons of local moments of g (from redshift surveys) and S (from peculiar velocities) or 
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the corresponding power spectra or correlation functions; (b) linear regressions of the fields 
g and 6 or the corresponding velocity fields; (c) analyses of redshift distortions in redshift 
surveys; and (d) comparisons of the Cosmic Microwave Background (CMB) dipole with the 
Local-Group velocity as predicted from the galaxy distribution. 

In order to sharpen our determination of Q it is important that we understand the 
sources for this scatter in the estimates of (3. Some of this scatter is due to the different 
types of galaxies involved, and some may be due to unaccounted-for effects of nonlinear 
gravity and perhaps other sources of systematic errors in the data or the methods. In this 
paper we investigate the possible contribution to this scatter by nontrivial properties of the 
biasing scheme — the deviations from linear biasing and the stochastic nature of the biasing 
scheme. This is done using a simple and natural formalism for general galaxy biasing. 

The biasing of density peaks in a Gaussian random field is well formulated (e.g., Kaiser 
1984; Bardeen et al. 1986) and it provides a very crude theoretical framework for the origin 
of galaxy density biasing. In this scheme, the galaxy-galaxy and mass-mass correlation 
functions are related in the linear regime via 

egg(r) = b'Umir), (1) 

where the biasing parameter 6 is a constant independent of scale r. However, a much more 
specific linear biasing model is often assumed in common applications, in which the local 
density fluctuation fields of galaxies and mass are assumed to be deterministically related 
via the relation 

g{x) = b6{x). (2) 
Note that equation (|T]) follows from equation (^, but the reverse is not true. 

The deterministic linear biasing model is not a viable model. It is based on no theoretical 
motivation. If 6 > 1, it must break down in deep voids because values of g below —1 are 
forbidden by definition. Even in the simple case of no evolution in comoving galaxy number 
density, the linear biasing relation is not preserved during the course of fluctuation growth. 
Non-linear biasing, where b varies with 6, is inevitable. 

Indeed, the theoretical analysis of the biasing of collapsed halos versus the underlying 
mass (Mo & White 1996), using the extended Press-Schechter approximation (Bond et al. 
1991), predicts that the biasing is nonlinear and provides a useful approximation for its 
behavior as a function of scale, time and mass threshold. A^-body simulations provide a 
more accurate description of the nonlinearity of halo biasing (see Figure |I|; Somerville et 
al. 1998), and show that the model of Mo & White is a good approximation. We provide 
more details about theoretical, numerical and observational constraints on the exact shape 
of nonlinear biasing in § |^, where we estimate the magnitude of nonlinear biasing effects. 
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It is important to realize that once the biasing is nonhnear at one smoothing scale, 
the smoothing operation acting on the density fields guarantees that the biasing at any 
other smoothing scale obeys a different functional form of b{S) and is also non-deterministic. 
Thus, any deviation from the simplified linear biasing model must also involve both scale- 
dependence and scatter. 

The focus of this paper is therefore on the consequences of the stochastic properties of 
the biasing process, which could either be related to the nonlinearity as mentioned above, or 
arise from other sources of scatter. An obvious part of this stochasticity can be attributed 
to the discrete sampling of the density field by galaxies — the shot noise. In addition, 
a statistical, physical scatter in the efficiency of galaxy formation as a function of 6 is 
inevitable in any realistic scenario. It is hard to believe that the sole property affecting 
the efficiency of galaxy formation is the underlying mass density at a certain smoothing 
scale (larger than the scale of galaxies). For example, the random variations in the density 
on smaller scales is likely to be refiected in the efficiency of galaxy formation. As another 
example, the local geometry of the background structure, via the deformation tensor, must 
play a role too. In this case, the three eigenvalues of the deformation tensor are relevant 
parameters. Such hidden variables would show up as physical scatter in the density- density 
relation. A similar scatter is noticeable in the distribution of particular morphological types 
versus the underlying total galaxy distribution (Lahav & Saslaw 1992). The hidden scatter 
is clearly seen for halos in simulations including gravity alone (§ ^ below and Figure |T] based 
on Somerville et al. (1998) even before the more complex processes involving gas dynamics, 
star formation and feedback affect the biasing and in particular its scatter. 

In practice, there are two alternative options for dealing with the shot noise component 
of the scatter. In some cases one can estimate the shot noise and try to remove it prior to 
the analysis of measuring (3. This is sometimes difficult, e.g., because of the finite extent of 
the galaxies which introduces anti-correlations on small scales. The shot noise is especially 
large and hard to estimate in the case where the biasing refers to light density rather than 
number density. The alternative is to treat the shot noise as an intrinsic part of the local 
stochasticity of the biasing relation without trying to separate it from the physical scatter. 
The formalism developed below is valid in either case. 

In § ^ we present the biasing formalism, separate the effects of nonlinear biasing and 
stochastic biasing, and apply the formalism to measurements involving local second-order 
moments of 5 and g. In § ^ we derive relations for two-point correlation functions in the 
presence of local biasing scatter. In § ^ we apply the formalism to the analysis of redshift 
distortions. In § | we address methods involving third-order moments. In § ^ we discuss 
constraints on the nonlinearity and scatter in the biasing scheme based on simulations, simple 
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models and observations. In § |^ we summarize our conclusions and discuss our results and 
future prospects. 

2. LOCAL MOMENTS 

2.1. Conditional Distribution 

Let S{x) (= Sp/p) be the field of mass-density fluctuations and g{x) (= Sn/n) the 
corresponding field of galaxy-density fluctuations (or, alternatively, the field of light-density 
fluctuations, whose time evolution is less sensitive to galaxy mergers). The fields are both 
smoothed with a fixed smoothing window, which defines the term "local". The concept 
of galaxy biasing is meaningful only for smoothing scales larger than the comoving scale 
of individual galaxies, namely a few Mpc. An example would thus be a top-hat window 
of radius 8 h^^Mpc, for which the rms fluctuations of optical galaxies is about unity. Our 
analysis is confined to a specific smoothing scale, at a specific time, and for a specific type 
of objects. 

Assume that both 6 and g are random fields, with one-point probability distribution 
functions (PDF) P{S) and P{g), both of zero mean by definition and of standard deviations 
(7^ = (5^) and = (g"^)- The key idea is to consider the local biasing relation between galax- 
ies and mass to also be a random process, specified by the biasing conditional distribution 
P{g\S) of (7 at a given 6. 

We shall use below in several different ways the following lemma relating joint averaging 
and conditional averaging for any functions p{g) and q{S): 

{p{9)qiS)) = {{p{9m,\sq{S))s, (3) 

where the inner average is over the conditional distribution of (7 at a given 6 and the outer 
average is over the distribution of 6. This is true because 

{p{9)q{S)) =J J dgd6P{g,6)p{g)q{6) = J d6P{6)q{6) J dg P{g\6) p{g), (4) 

in which the first equality is by definition, and where for the second equality P{g, S) has 
been replaced by P{g\6)P{6) and the double integration has become successive. 
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2.1.1. Conditional Mean - Nonlinearity 

Define the mean biasing function b{6) by the conditional mean, 

b{6)6^{g\6) = JdgP{g\6)g. (5) 

This function is plotted in Figure |l[ This is a natural generalization of the deterministic 
linear biasing relation, g = bi6. The function b{6) allows for any possible non-linear biasing 
and fully characterizes it; it reduces to the special case of linear biasing when b{6) = 6i is a 
constant independent of S. 

In the following treatment of second-order local moments, we will find it natural to 
characterize the function b{S) by the moments b and b defined by 

b= {b{6)6^)/a^ and b^ = {b\6)6^)/a\ (6) 

In the case of linear biasing they both coincide with bi. It will be shown that the parameter b 
is the natural generalization of bi and that the ratio 6/6 is the relevant measure of nonlinearity 
in the biasing relation; it is unity for linear biasing, and is either larger or smaller than 
unity for nonlinear biasing. As can be seen from the definitions of b and 6, this measure 
is independent of the stochasticity of the biasing. It thus allows one to maintain general 
nonlinearity while addressing stochasticity. 



2.1.2. Conditional Variance - Stochasticity 

The local statistical character of the biasing relation can be expressed by the conditional 
moments of higher order about the mean at a given 5. Define the random biasing field e by 

e^g-{g\6), (7) 

such that its local conditional mean vanishes by definition, {e\6) = 0. The local variance of 
e at a given 6 defines the biasing scatter function a^^S) by 

aliS) ^ {e'\6)/a'. (8) 

The scaling by is for convenience. The function {e'^\5)^^'^ is marked by error bars in 
Figure |I]. By averaging over S, and using equation (^), one obtains the constant of local 
biasing scatter, 

al ^ {e')/a\ (9) 
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Thus, to second order, the nonlinear and stochastic biasing relation is characterized 
locally by three basic parameters: b, b, and a^,. The parameters b and b/b refer to the 
mean biasing and its nonlinearity, while o"b/& measures the scatter. This parameterization 
separates in a natural way the properties of nonlinearity and stochasticity. The formalism 
simply reduces to the case of linear biasing when b = b and to deterministic biasing when 

(Tb = 0. 

If the biasing conditional distribution, P{e\6), is a Gaussian [still allowing b{6) and 
cr^{S) to vary with S], then the first- and second-order moments fully characterize the biasing 
relation. In much of the following we will restrict ourselves to second moments, but we 
shall see in § ^ that a generalization to higher biasing moments, such as the skewness, is 
st r aight f or war d . 



2.2. Variances and Linear Regression 

From the basic parameters defined above one can derive other useful biasing parameters. 
A common one is the ratio of variances , sometimes referred to as "the" biasing parameter, 

bL ^^,=b^ + al (10) 

The second equality is an interesting result of equation . It immediately shows that 6var is 
sensitive both to nonlinearity and to stochasticity, and that always 6var > b. Note the roles 
of b^ and 0"^ as the respective contributions of biasing nonlinearity and biasing scatter to the 
total scatter in g. This makes 6var biased compared to b, 

(11) 

by the root of the sum in quadrature of the nonlinearity factor b/b and the scatter factor 
(Th/b. 

Using equation @, the mean parameter b is simply related to the covariance , 

ba' = {g6). (12) 

Thus, b is the slope of the linear regression oi g on 6 and it serves as the basic biasing pa- 
rameter — the natural generalization of the linear biasing parameter bi . Unlike the variance 
{g"^) in equation (|TDp, the covariance in equation (^) has no additional contribution from 
the biasing scatter ab- 
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A complementary parameter to 6var is the linear correlation coefficient , 



^^Wt(t^SV'\ (13) 

fXgfJ 6var feV 

The equahties are based on equation (0) and equation (|1T]) respectively. 
The "inverse" regression, oi 6 on g, yields another biasing parameter: 

The parameter 6inv is closer to what is measured in reality by two-dimensional linear regres- 
sion (e.g., Sigad et al. 1998), because the errors in S are typically larger than in g. Note that 
feinv is biased compared to b, with the ratio given by the sum in quadrature of the nonlinearity 
factor b/b and the scatter factor a^/b. 

It is worthwhile to summarize the relations between the parameters in the two degenerate 
cases. In the case of linear and stochastic biasing, the above parameters reduce to 

(2 \ t / 2 \ 

1 + , r = ^, 6i,, = 6i(l + ^j. (15) 

Thus, bi < 6var < ^inv 

In the case of nonlinear and deterministic biasing they reduce instead to 

^ ^ ~ b P 

bj^b, cTb = 0, 6var = b, r = J, b-,^^ = ^. (16) 

b 

Both 6inv and &var are biased with respect to b, and the bias in binv is always larger. Whether 
they are biased high or low compared to b depends on whether the nonlinearity factor b/b 
is larger or smaller than unity, respectively. We shall see in § ^ that although b and b could 
significantly differ from unity and from b{6 = 0), the ratio b/b, in realistic circumstances, 
typically obtains values in the range 1.0 < b/b < 1.1. This means that the effects of 
nonlinearity are likely to be relatively small. 

In the fully degenerate case of linear and deterministic biasing, all the b parameters 
are the same, and only then r = 1. Note, again, that the parameters b/b and a^/b nicely 
separate the properties of nonlinearity and stochasticity, while the parameters fevar? i^ and 
6inv mix these properties. 



In actual applications, the above local biasing parameters are involved when the pa- 
rameter is measured from observational data in several different ways. For linear and 
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deterministic biasing this parameter is defined unambiguously as Pi = f{Q)/bi. But any 
deviation from tliis degenerate model causes us to actually measure different (3 parameters 
by the different methods. 

For example, it is the parameter /5var = /(^)/&var which is determined from measure- 
ments of (jg and af{fl). The former is typically determined from a redshift survey, and 
the latter either from an analysis of peculiar velocity data, or from the abundance of rich 
clusters (with a slightly modified Q dependence), or by COBE normalization of a specific 
power-spectrum shape for mass density fluctuations. As noted in equation (^), in the 
case of stochastic biasing b^ar is always an overestimate of b. When the biasing is linear, 
equation (|r^), 6var is an overestimate of bi. The corresponding /?var is thus underestimated 
accordingly. 

A useful way of estimating j3 (e.g., Dekel et al. 1993; Hudson et al. 1995; Sigad et al. 
1998) is via the linear regression of the fields in our cosmological neighborhood, e.g., —'V'v{x) 
on g{x) (or, alternatively, via a regression of the corresponding velocities). In the mildly- 
nonlinear regime, — V-v(x) is actually replaced by another function of the peculiar velocity 
field and its first spatial derivatives, which better approximates the scaled mass-density field 
f{Vt)5{x) (e.g., Nusser et al. 1991). The regression that is done, taking the errors on both 
sides into account, is effectively 5 on g, because the errors in V-v (or f5) are typically more 
than twice as large as the errors in g (e.g., Sigad et al. 1998). Hence, the parameter that 
is being measured is close to /3inv = /(^)/&inv In the case of linear gravitational instability 
and linear deterministic biasing, the slope of this regression line is simply (3i. Note in 
equation (0) that in the case of linear and stochastic biasing 6inv is an overestimate of bi. 
The corresponding /3 is thus underestimated accordingly in this inverse-regression analysis. 



3. TWO-POINT CORRELATIONS 

For the purpose of redshift-distortion analysis we need to generalize our treatment of 
stochastic and nonlinear biasing to deal with spatial correlations. Given the random biasing 
field e, equation (|^, we define the two-point biasing-matter cross-correlation function and 
the biasing auto- correlation function by 

e.m(r) = (ei52), U{r) = {e^e^), (17) 

where the averaging is over the ensembles at points 1 and 2 separated by r. (Recall that 
throughout this paper the fields are assumed to be smoothed with a given window.) By the 
definition of the random biasing field e, and the local scatter, equation (^, at zero lag one 
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has ^em(O) = and ^«(0) = cr^a'^. We now define the biasing as local if 

^era{f^) = for any r, and ^et{f^) = for r > rb, (18) 

where rb is typically on the order of the original smoothing scale. (Some implications of 
local biasing are discussed in Scherrer & Weinberg 1998.) 

A two-point equivalent lemma to equation (^ implies that 

{giS2) = {{9lS2\SlS2)g\5)5 and {9192) = {{9192 \Sl52)g\5)5- (19) 

Using these identities, one obtains analogous relations to equations (|I2[) and ([T0|): 

egm(r) = {9162) = {b{6i)6i 62) + U{r), (20) 
egg(r) = {9192) = m)5, b{52)52) + Ur). (21) 
In the case of linear and local biasing, these become 

egm(r) = hUm{r), (22) 

^gs{r) = blUUr)+^ee{r), with ^^(r) = for r > rt. (23) 

Note that the biasing parameter that appears here is hi, not to be confused with fe^ar when 
the biasing is stochastic. 

To see how the power spectra are affected by the biasing scatter, we assume, without 
limiting the generality of the analysis, that the local biasing can be approximated by a step 
function: 

Ur) = if'" (24) 

[0 r > rb 

Recalling that the power spectra are the Fourier transforms of the corresponding correlation 
functions, 

P(k) = 471 r arf-^^^rHr, (25) 
JO kr 

we get for k <^ r^^^, from ( ^2]) and (|2^), 

Pgm(A;) = hPra^ik), (26) 

Pgg(A;) = blP^Uk) + cjyV^, (27) 

where 14. is the volume associated with the original smoothing length, rb. We see that the 
local biasing scatter adds to Pgg{k) an additive constant at all k values. 
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Finally, one can address the effect of the scatter on moments of the fields smoothed at 
a general smoothing length r > rt- Each of these moments is related to the corresponding 
power spectrum via an integral of the form 

{66)r = J d^kW\kr)P{k), (28) 

where W{kr) is the Fourier Transform of the smoothing window of radius r. Using equa- 
tion (^) one obtains for linear biasing 

al{r) = bla\r) + ala\ry,/r)'. (29) 

In the following section we will apply this formalism to the linear analysis of redshift distor- 
tions. 

For the purpose of an analysis involving nonlinear biasing, note that two-point averages 
as in equations (pOD and (pT]) are calculable once one knows the function b{6) and the one- 
and two-point distributions of the underlying density field 6. It turns out that the relation 
for ^gm, equation (PUj), is in fact a simple extension of equation (|T^) involving only the local 
moment b: 

egm(r) = kmm{r). (30) 
To prove this, we use P{6i,62) = P{62\6i)P{6i) to write 

{K5i)S,S2) = I d5^PiS^)bi5^)5^ J d52P{52\5,)52, (31) 

and use the fact that (52|5i) = 5i'Cmm('^)/c"^- To compute b one needs to know only the 
function b{5) and the one-point distribution P{5). Higher-order moments, like the one in 
equation (|2l|) , would, in general, involve the two-point PDF as well. 



4. REDSHIFT DISTORTIONS 

A very promising way of estimating (3 is via redshift distortions in a redshift survey 
(e.g.. Kaiser 1987, Hamilton 1992, 1993, 1995, 1997; Fisher et al. 1994; Fisher, Scharf & 
Lahav 1994; Heavens & Taylor 1995; Cole, Fisher & Weinberg 1995; Fisher & Nusser 1996; 
Lahav 1996). Peculiar- velocity gradients along the line of sight distort the comoving volume 
elements in redshift space compared to the corresponding volumes in real space. As a result, 
a large-scale isotropic distribution of galaxies in real space is observed as an anisotropic 
distribution in redshift space. The relation between peculiar velocities and mass density 
depends on fi, and hence the distortions relative to the galaxy density depend both on Vt 
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and on the galaxy biasing relation. In the deterministic and linear biasing case, this relation 
involves a single (3 parameter (see Hamilton 1997). However, in the general biasing case, the 
distortion analysis is in principle complicated by the fact that the galaxies play two different 
roles; they serve both as luminous tracers of the mass distribution as well as test bodies for 
the peculiar-velocity field. 

To first order, the local galaxy density fluctuations in redshift space ((^s) and real space 
{g) are related Qs = 9 — du/dr, where u is the radial component of the galaxy peculiar 
velocity v{x). Assuming no velocity biasing, linear GI theory predicts du/dr = —fi'^f{fl)6, 
where /x^ is a geometrical factor depending on the angle between v and x. Thus, the basic 
linear relation for redshift distortions is 

9s = 9 + fl^'S. (32) 

A general local expression for redshift distortions is obtained by taking the mean square: 
{9s9s) = (99) + 2(//i') {9S) + (//i')' (M). (33) 
With our formalism for stochastic biasing, using equation ( pUj ) and equation ([T^), it becomes 

al = alii + 2(/^2)rC + (34) 

This is similar to equation (7) of Pen (1998), in the sense that it involves both fe^ar and r 
in a non-trivial way and is thus directly affected by the stochasticity of the biasing scheme. 
However, we shall see that this is true only for the local moments, at the original smoothing 
length for which 6var and r were defined. When the biasing is stochastic, the situation at 
non-zero lag is very different. 



4.1. General Linear Redshift Distortions at Non-zero Lag 

The general linear analysis of redshift distortions involves non-local analysis. The gen- 
eral expression in terms of correlation functions is obtained straightforwardly from equa- 
tion ( p2D by averaging {9I92) over the distributions of 5 at a pair of points separated by 
r: 

C^ir) = egg(r) + 2(//i2)eg^(r) + {f^^^fUm{r). (35) 
(Recall that our correlation functions and power spectra correspond to the smoothed fields.) 

Recalling that the power spectra are the Fourier transforms of the corresponding corre- 
lation functions, equation (|25|) , one can equivalently write 

PL^k) = P,,{k) + 2(//x2) Pg^(A;) + (/^2)2 p^^^^j^y ^3g) 
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[This expression can alternatively be obtained from the fact that equations (^) and ( pSD are 
valid for any smoothing scale r.] 

Similarly, the spherical harmonic analysis for redshift distortions in linear theory for a 
flux-limited survey (Fisher, Scharf & Lahav 1994, equation 11) can be extended to yield for 
the mean-square harmonics in redshift space: 

= IJ dke[\^i{k)\'p^{k) + 2mi{kmk)\p,mik) + f\^m\' Pmmm, m 

where is the real-space window function and "^fik) is the redshift-correction window 

function, both depending on the selection function and the weighting function. Again, this 
expression mixes the three different power spectra such that the Q dependence, in general, 
may involve more than one unique (3. 

The crucial question is how to relate the correlation functions, or power spectra, to 
the biasing scheme. In the case of linear and deterministic biasing, one simply has Pgg = 
^i-Pgm = &i-Pmm, SO the distortiou relation reduces to Kaiser's formula, 

P4 = Pgg(l+/i2/3l)^ (38) 

where Pi = f{fl)/bi (and here bi = h^aj)- In order to obtain more specific distortion relations 
for the case of stochastic biasing, we shall use the spatial correlations from § |5 in the general 
distortion relations of the current section. 



4.2. Distortions for Linear, Stochastic and Local Biasing 

At zero lag, by definition, ^ee(O) = cr^o"^. Then, as in equation (10) for the local moments 



Cgg(O) = &var^mm(0), and the general distortion relation for ^, equation (|35|) , reduces to an 
equation similar to the local equation (0), in which the second and third terms involve 
different combinations of r and 6var and thus allow to determine them separately. However, 
at large separations r > rb, where vanishes by the assumption of local biasing, one obtains 
instead, from equations (|2^ ) and (p^) 

^ggW = egg(r)[i + 2(//.^)6r^ + (//.2)2 6r']. (39) 

This is the degenerate Kaiser formula, which is very different from the expression for local 
moments, equation (^41). In particular, it is independent of the biasing scatter! It now 
involves only the mean biasing parameter hi (a degenerate combination of r and 6var), but 
it contains no information on the stochasticity, db. In terms of 61, the relation for ^ is 
identical to the case of deterministic biasing. Thus, the distortion analysis at r > rb is 
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indeed incapable of evaluating the stocliasticity of the process. This is a straightforward 
result of the assumed locahty of the biasing scheme. The biasing scatter at two distant 
points is uncorrelated and therefore its contribution to ^gg cancels out. 

On the other hand, the redshift distortion analysis is sensitive to the non-linear prop- 
erties of the biasing relation. A proper analysis would require a nonlinear treatment of the 
redshift distortions including a nonlinear generalization of the GI relation V-t> = — /5, be- 
cause the nonlinear effects of biasing and gravity enter at the same order. The result is more 
complicated than equation (|35D, but is calculable in principle once one knows the function 
b{6) and the one- and two-point probability distribution functions of S. The insensitivity to 
stochasticity remains valid in the case of nonlinear biasing. 

Back to the case of linear stochastic biasing. The distortion relation for P{k) becomes 
more complicated because of the additive term in equation (^71). For linear biasing, when 
substituting equation (^) in equation (|36|), the terms analogous to the ones involving b^^ 
and in equation ( pOj ) for ^ are multiplied by [1 — a^a'^Vb/ Pgg{k)], a function of k. The 
distortion relation for P{k) is thus affected by the biasing scatter in a complicated way. 

However, if the scatter is small, there may he a k range around the peak of P{k) 
where the additive scatter term in equation ( ^7]) is small compared to the rest. In this 
range the relation reduces to an expression similar to equation (|39|) for the corresponding 
power spectra. For example, if ~ 8 /i^^Mpc we have Vt, ~ 2 x 10'^( /i~^Mpc)^, while 
Pmm{k) ~ 10'*(/i-^Mpc)3 at the peak (e.g., Kolatt & Dekel 1997), so a significant k range 
of this sort is viable, especially if a^o' ^ bi. On the other hand, the scatter term always 
dominates equation (^) at small and at large /c's. If cx^^cr ~ 1, then the scatter may dominate 
already not much below k ~ 0.01{h~^Mpc)~^. 

In terms of moments of a general smoothing length r > rb, using equation (^) in 



equation (|33D one obtains a complicated distortion relation again. For small scatter there 
may be a limited range of scales for which the first term in equation (p9D dominates and 
then the distortions reduce to an equation similar to equation ( p9D for moments of smoothed 
fields. At large enough scales, where P^^a is rising with k and thus is decreasing faster 
than oc x~^, the scatter becomes dominant. 

Note that in order to obtain equation (7) of Pen (1998) from the general linear distor- 
tion relation, equation (|36|) , one has to define fc-dependent biasing parameters by Pgg{k) = 
b^axiky P-aira{k) and P^^{k) = b^s_r{k)r{k)P^rn{k). (Note that Pen's /3 refers to his bi, which 
is equivalent to our b^ar, except that he allows it to vary with k). In the case of local biasing, 
a comparison to our equation (p6|) and equation (^) yields 6var(^)^ = bf + 0"bCr^Vb/-Pmm(fc) 
and b^i,Tik)r{k) = bi. In the k range near the peak of Pmm(^) where the constant term in 
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equation (^) may be negligible, one has &var(^) = &i and r(A;) = 1, and there is indeed no 
sign of the stochasticity in the distortion relation. 



5. SKEWNESS AND THREE-POINT CORRELATIONS 

5.1. Skewness 

We now move to measures of biasing involving third-order moments. Given the bi- 
asing random fields e, define the biasing skewness function Sh{6), in analogy to o"b(5) of 
equation (||), by 

S^i5)S^{e'\5), (40) 
where S = (S^)- After averaging over 6, the biasing skewness parameter is 

5b = {e')/S. (41) 

The biasing parameter that is defined by ratio of skewness moments is then, based on 
equation (|^) and after some algebra, 

&3 = ^-^^ + g + S,. (42) 

In the case of deterministic biasing, S'b = Cb = 0, one has 

4-^, (43) 

which differs from the parameters b and 6 of § ^ due to nonlinear effects. In the linear case 
where both b{6) and (Jb(5) are constants, the expression for 63 reduces to 

&3 = &i(l + (44) 

Now, if S'b = 0, then 63 = bi independently of Ub. If, on the other hand, P{g\S) is positively 
skewed, then 63 > 61. 

An interesting quantity involving the skewness and variance of 6 is S3 = S/a'^. In 
the second-order approximation to GI with Gaussian initial fluctuations this quantity is 
constant in time. For top-hat smoothing and a given power spectrum of an effective power 
index n at the smoothing scale, this constant is 5*3 = 34/7 — (3 + n) (Juszkiewicz et al. 
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1993). The corresponding quantity involving the moments of g, S^g = (g^) / {g"^)"^ , provides 
an observational measure of biasing (Weinberg 1995): 



In the case of deterministic biasing 



~ {6%^{6))/S ■ 

In the case of linear biasing where b{6) and ah{S) are constants, this ratio reduces to 



(46) 



'1 + ^2 /1^2^2 



The biasing parameter obtained this way thus depends both on and 5'b. If S'b = 0, as 
when P{g\S) is Gaussian, then 633 = &i (1 + <^b/^i)^; deviation from bi is even larger 

than that of 6inv or 6var, equation (|1^). For positive biasing skewness S'b, the parameter 633 
may in fact become smaller than bi. 

Szapudi (1998) has shown that, under certain simplifying assumptions, the biasing pa- 



rameters (taken as two coefficients in a Taylor expansion, equation (|53D ) can be determined 
using three-point statistics (cumulant correlators). The assumptions made are that the bias- 
ing is local, deterministic and scale independent, and that redshift distortions are neghgible. 
This approach should be generalized to the more realistic case of stochastic biasing and to 
allow other nontrivial features in the biasing scheme. 



5.2. Cosmic Virial Theorem and Energy Equation 

The Cosmic Energy Equation (CE) (Peebles 1980, §74; Peebles 1993, eq. 20.11; Davis, 
Miller & White 1997) can be used to determine Q by relating the observed dispersion of 
galaxy peculiar velocities to a spatial integral over the galaxy-mass cross-correlation func- 
tion, ^gm('")- The observable is the galaxy-galaxy auto-correlation function ^gg(r), so the 
corresponding biasing parameter is 

bcc = {gg)/{g5). (48) 



At zero lag, bee = &inv of equation (p!4D. At non-zero lag, 6ce can be derived from equations (^ 
and (^). For linear biasing, equations (|22| ) and (|23|), one obtains at non-zero lag bee = &i 
which is different from b[^^ if the biasing is stochastic, equation (|15|). 
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The estimation of Q via the Cosmic Virial Theorem (CV), as apphed to galaxy surveys 
(Peebles 1980, §75; Bartlett & Blanchard 1996), relates the observed dispersion of galaxy- 
galaxy peculiar velocities to a spatial integral over the 3-point galaxy-galaxy-mass cross- 
correlation function, ,^ggm (divided by ^gg). The observable is the 3-point galaxy correlation 
function ,^ggg, so the corresponding biasing parameter is 



&CV = {gg9)/{995). (49) 

At zero lag, using equation (^, 



In the case of deterministic but nonlinear biasing, 6cv = {S^b^{S)) / {S^b'^{6)), which in 
general differs from any of the biasing parameters discussed so far. If b{S) and a^iS) are 
constants and the biasing is stochastic, then, at zero lag, = ^l/^i = &i (1 + Si,/bf). 

If the analysis is done on scales smaller than the biasing coherence length rt, then 
the local expressions are relevant. Otherwise, one needs to appeal to three-point spatial 
correlations. 

Note that in the case of CV or CE (which are valid on small scales) the measured 
quantity may be Q/b or Q/b"^, depending on the application, rather than (3 which is typical 
in applications based on the linear approximation to gravitational instability. 



6. CONSTRAINTS ON THE BIASING RELATION 



In the scheme outlined above, the local biasing process at given scale, time and galaxy 
type, is characterized by the conditional probability density function P{g\6). The conditional 
mean, or the function b{6), contains the information about the mean biasing (via the pa- 
rameter b) and the nonlinear features (e.g., via b/b). The first additional quantity of interest 
in the case of non-negligible scatter in the biasing relation can be the conditional standard 
deviation, the function crb(5), and its variance over S, a^. In order to evaluate the actual 
effects of nonlinear and stochastic biasing on the various measurements of /?, one should first 
try to constrain these functions or evaluate these parameters from simulations, theoretical 
approximations, and observations. 
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6.1. Preliminary Results from Simulations 

In an ongoing study, Somerville et al. (1998) are investigating tlie biasing in liigh- 
resolution iV-body simulations of several cosmological scenarios, both for galactic halos and 
for galaxies as identified using semi-analytic models. Earlier results from simulations were 
obtained, e.g., by Cen & Ostriker (1992) and in more detail by Mo & White (1996). We 
refer here to an example of the preliminary results of Somerville et al. , in the context of our 
biasing formalism. As our test case we use a representative cosmological model: Vt = 1 with 
a rCDM power spectrum which roughly obeys the constraints from large-scale structure. 
The simulation mass resolution is 2 x lO^^Mg inside a box of comoving side 85 /i~^Mpc. The 
present epoch is identified with the time when the rms mass fiuctuation in a top-hat sphere 
of radius 8/i~^Mpc is a% = 0.6. 

Figure |l| is borrowed from Somerville et al. (1998) in order to demonstrate the qualitative 
features of the biasing scheme. It shows the density fluctuation fields of galactic halos 
versus mass at the points of a uniform grid at two different times. The halos are selected 
above a mass threshold of 2 x lO^^Mg . The fields are smoothed with a top-hat window of 
radius 8/i~^Mpc. The conditional mean [{g\S) = b{6)6] and the conditional scatter [(e^|(5) = 
al{S)a^] are marked. 

The nonlinear behavior in the negative regime, 5 < 0, is characteristic of all masses, 
times, and smoothing scales: the function {g\S) is fiat near g = 6 = —1 and it abruptly 
steepens towards 6 = 0. In the positive regime, 6 > 0, the behavior is less robust — it 
strongly depends on the mass, time and smoothing scale. The scatter in the figure includes 
both shot noise and physical scatter which are hard to separate properly. The scatter function 
crh{6) grows rapidly from zero at 5 = —1 to a certain value near 6 = 0, and it continues to 
grow slowly for 5 > to an asymptotic value at large 6. 

In the case shown at 2; = 0, the nonlinear parameter is 6^/6^ = 1.08, and the scatter 
parameter is cr^/b"^ = 0.15. The effects of stochasticity and nonlinearity in this specific case 
thus lead to moderate differences in the various measures of /3, on the order of 20 — 30%. 
Gas-dynamics and other non-gravitational processes may extend the range of estimates even 
further. 

A recent hint for the origin of physical scatter in the biasing scheme is provided by 
Blanton et al. (1998). They find, based on hydrodynamic cosmological simulations, that the 
local gas temperature, which is an important factor affecting the efficiency of galaxy forma- 
tion, is not fully correlated with the other dominant factor — the local mass density. They 
therefore argue that this is a physical hidden variable that contributes to the stochasticity 
proposed in our current paper. They also detect significant scale dependence in the biasing 
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Fig. 1. — Biasing of galactic halos versus mass in a cosmological A'^-body simulation, demonstrating 
nonlinearity and stochasticity. The conditional mean (solid curve) and scatter (error bars) are 
marked. The fields smoothed with a top-hat window of radius 8/i~^Mpc are plotted at the points 
of a uniform grid. Left: at the time when as = 0.6 (e.g., z = 0). Right: at an earlier time when 
(78 = 0.3 (e.g., z = 1). Based on Somerville et al. (1998). 

scheme, and identify its main source with the correlation of the temperature with the large 
scale gravitational potential. 



6.2. Approximations for Nonlinear Biasing 

Given the distribution P{S) of the matter fluctuations, the biasing function b{S) should 
obey by definition at least the following two constrains: 

• g > —1 everywhere because the galaxy density pg cannot be negative, and g — —1 at 
S — —1 because there are no galaxies where there is no matter. 

• (g) — because g describes fluctuations about the mean galaxy density. 

An example for a simple functional form that obeys the constraint at 5 — —1 and 
reduces to the linear relation near 5 = is (e.g., Dekel et al. 1993) 

{g\5) ^0(1 + 5)' -1. (51) 
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The constraint (g) = is to be enforced by a specific choice of the factor c as a function 
of the power b. With 6 > 1, this functional form indeed provides a reasonable fit to the 
simulated halo biasing relation in the S < regime. However, the same value of b does not 
necessarily fit the biasing relation in the 6 > regime, which can require either 6 < 1 or 
b > 1 depending on halo mass, smoothing scale and redshift. 

A better approximation could thus be provided by a combination of two functions like 
equation (^Tj) with two different biasing parameters b^^g and 6pos in the regimes 6 < and 
S > respectively. Another useful version of such a combination is 

[ bposd + bo >0 

which provides an even better fit to the behavior in Fig. |l]. As mentioned above, the pa- 
rameter 6neg is always larger than unity while 6pos ranges from slightly below unity to much 
above unity. The best fit to Fig. |l] at z = has b neg ~ 2 and fepos ~ 1 • At high redshift both 
6neg and fepos become significantly larger. 

The nonlinear biasing relation can alternatively be parameterized by a general power 
series: 

9 = t (53) 

n=0 "■■ 

Since g must average to zero, this power series can be written as 

{g\6) = b,5 + \b,{5^ - a') + h,{6' - S) + (54) 

where cx^ = (5^), S = (S^), etc. This determines the constant term b^. The constraint at — 1 
provides another relation between the parameters. Therefore, the expansion to third order 
contains only two free parameters out of four. 

In order to evaluate the parameters b and b for these nonlinear toy models, we approx- 
imate the density PDF as a Gaussian, or alternatively as log-normal in p/p = 1 + 6 (e.g.. 
Coles & Jones 1991; Kofman et al. 1994): 

^^^^ " [27rln(l + a2)]i/2 (1 + 5) ^ ^ 1^ 21n(l + a^) j' ^^^^ 

The only free parameter is a. The skewness, for example, is S* = 2cr^ -|- a^, etc. 

For nonlinear biasing that is described by Taylor expansion to third order (equation (Q)) 
and a Gaussian or log-normal density PDF, assuming 62 ^ bi and o" ^ 1, one obtains 
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This is always larger than unity, but the deviation is small. Alternatively, using the functional 
form of equation (|5lD, with 6neg ranging from 1 to 5, 6pos ranging from 0.5 to 3, and a log- 
normal PDF of a = 0.7, we find numerically that b/b is in the range 1.0 to 1.15. These two 
toy models, which approximate the nonlinear biasing behavior seen for halos in the A^-body 
simulations, indicate that despite the obvious nonlinearity, especially in the negative regime, 
the nonlinear parameter b/b is typically only slightly larger than unity. This means that the 
effects of nonlinear biasing on measurements of (3 are likely to be relatively small. 



6.3. The Special Case of Gaussian Biasing 



A quick comment on "Gaussian" and "bivariate Gaussian" biasing, which has been used 
in the recent literature (e.g.. Pen 1998). 

A specific model for nonlinear and stochastic biasing is where the conditional distribution 
is a Gaussian, but allowing b{S) and cr^(5) to vary with S: 



P{g\S) oc exp 



[g - bm' 



Based on the A^-body simulations, this is a reasonable approximation for galactic halos 



(57) 



However, the model of bivariate Gaussian biasing, which might be tempting because it 
makes some of the computations easier, is much more restrictive; it is in fact a special case 
of linear biasing. This model assumes that the joint distribution of galaxies and mass is a 
two-dimensional Gaussian, 



P{g, 6) oc exp 



g'' - 2rg6 + r 
2(1 -r2) 



(58) 



where g ~ g/og and 5 = and with the local biasing correlation coefficient r =const. 
If -P(5) is also a Gaussian, P{5) oc exp[— 5^/2], the conditional probability is 



Pi^g \~5) = OC exp 



{g - rSf 



(59) 



This is a one-dimensional Gaussian for ^, with mean r5 and variance (1 — r^). Back to 
the quantities g and 5, the conditional mean is {g\S) = rb^a^^S, where 6var = CTg/a as usual. 
This is thus a special case of linear biasing, with a constant linear biasing parameter bi = 
rb^s^r independent of 5, as in equation (p!5|). From the conditional variance of the Gaussian 
distribution in equation (|59D, the biasing scatter is also a constant independent of (5, = 
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6^(1 — r^). Based on A'"-body simulations, linear biasing could be a poor approximation 
even for galactic halos. The whole biasing scheme is characterized in this case by only two 
parameters, fcyax and r, or alternatively bi and ay,, independent of S. 

6.4. Observations 

Direct constraints on the local biasing field should, in principle, be provided by the 
data themselves, of galaxy density (e.g., from redshift surveys) versus mass density (e.g., 
from peculiar velocity surveys, or gravitational lensing). It is a bit early to deduce the 
nonlinear shape of b{5) from these data because of the large errors that they involve at 
present. However, we note a qualitative example of scatter in the biasing relation in the 
fact that the smoothed density peaks of the Great Attractor (GA) and Perseus Pisces (PP) 
are of comparable height in the mass distribution as recovered by POTENT from observed 
velocities (e.g., Dekel 1994; daCosta et al. 1996; Dekel et al. 1998), but PP is significantly 
higher than GA in the galaxy map (e.g., Hudson et al. 1995; Sigad et al. 1998). For example, 
a linear regression of the 12 /i~^Mpc-smoothed density fields of POTENT mass and optical 
galaxies in our cosmological neighborhood yields a ~ 2 per degree of freedom for the 
assumed errors in the data (Hudson et al. 1995). One way to obtain a ~ 1) desired, 
is to assume a biasing scatter of o"b ~ 0.5 in the optical density (while a ~ 0.3 at that 
smoothing). With bi ~ 1, one has cr^/bf ~ 0.25. This is only a very crude estimate, and 
there is yet much to be done along similar lines with future data. 

A promising method has been worked out (Sigad & dekel 1998; see also Dckcl 1998b) 
for recovering the mean biasing function b{6) and its associated parameters b and b from a 
measured PDF (or counts in cells) of galaxies in a redshift survey. If g{S) were deterministic 
and monotonic, then it could be derived from the cumulative PDFs of galaxies and mass, 
Cg{g) and C{6), via g{6) = C~^[C{5)] (also Narayanan & Weinberg 1998). It is found for 
halos in A^-body simulations that this is a good approximation for {g\6) despite the scatter. 
The other key point confirmed by a suite of simulations is that C{5) is relatively insensitive to 
the cosmological model or the fiuctuation power spectrum, and can be approximated for our 
purpose by a log-normal distribution in 1 + 5 (e.g., Bernardeau 1994; Bernardeau & Kofman 
1995). Thus, b{6) can be evaluated from a measured Cg{g) and the rms a of mass fluctuations 
on the same smoothing scale. Since redshift surveys arc by far richer than peculiar-velocity 
samples, this method will allow a better handle on b{6) than the local comparison of density 
fields of galaxies and mass. It can be applied to local redshift surveys as well as surveys of 
objects at high redshift. 

Constraints on the biasing scheme can also be obtained by comparing the clustering 
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properties of galaxies of different types in a given redshift surveys (e.g., Lahav & Saslaw 
1992). Indeed, partly motivated by the ideas of our current paper, a clear confirmation for 
nontrivial biasing, non-linear and/or stochastic beyond shot noise, has recently been reported 
by Tegmark & Bromley (1998) based on the Las Campanas Redshift Survey. 

7. CONCLUSIONS 

We have introduced a straightforward formalism for describing the biasing relation be- 
tween the density fluctuation fields of galaxies and mass, based on the conditional probability 
function P{g\6). The key feature of this formalism is the natural separation between non- 
linear and stochastic effects in the biasing scheme. The nonlinearity is expressed by the 
conditional mean via the function b{6), and the statistical scatter is measured by the condi- 
tional standard deviation, crb(5), and higher moments if necessary. For analyses using local 
moments of second order, the biasing scheme is characterized by three parameters: b mea- 
suring the mean biasing, b/b measuring the effect of nonlinearity, and a^/b measuring the 
effect of stochasticity. 

Deviations from linear and deterministic biasing typically result in biased estimates of 
the biasing parameter, or the parameter P (~ f2°-^/6), which depend on the actual method of 
measurement. The nonlinearity and the scatter lead to differences of order 6^/6^ and o-^/l)^ 
respectively in the different estimators of (3 using second-order local moments. They typically 
lead to an underestimate of /3 with respect to $ — f{fl)/b. Based on A^-body simulations 
and toy models, the effects of nonlinear biasing are typically small, on the order of 20% or 
less, and the effects of scatter could be somewhat larger. One expects the (3 parameters from 
second-order local moments to be biased in the following order: /?inv < /Svar < /3- 

The stochasticity affects the redshift-distortion analysis only by limiting the useful range 
of scales, especially in the analysis involving power spectra. In this range, for linear stochas- 
tic biasing, the basic linear expression reduces to the simple Kaiser formula for b{6) = b = bi 
(not 6var), and it does not involve the scatter at all. The distortion analysis is in princi- 
ple sensitive to the nonlinear properties of biasing, but the nonlinear effects, especially at 
low redshifts, are expected to be weak, and on the same order as the effects of nonlinear 
gravitational instability. This is good news for the prospects of measuring an unbiased (3 
from redshift distortions in the large redshift surveys of the near future (2dF and SDSS). A 
detailed nonlinear analysis of redshift distortions with nonlinear biasing will be reported in 
a subsequent paper. 

More detailed studies of simulations, including different recipes for galaxy formation, are 
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required in order to constrain the parameters of the biasing formahsm more accurately. The 
analysis could also be extended to include non-local biasing, using the biasing correlations 
as defined here. 

The study of stochastic and nonlinear biasing should be extended to address the time 
evolution of biasing because many relevant measurements of galaxy clustering are now being 
done at high redshifts. As seen in Fig. the biasing is clearly a function of cosmological 
epoch (e.g., Rees, private communication; Dekel & Rees 1987; Mo & White 1996; Steidel 
et al. 1996; 1998; Bagla 1998a, 1998b; Matarrese et al. 1997; Wechsler et al. 1998; Peacock 
1998). In particular, if galaxy formation is limited to a given epoch and the biasing is linear, 
one can show (e.g.. Fry 1996) that the linear biasing factor hi would eventually approach unity 
as a simple result of the continuity equation. Tegmark & Peebles (1998) have generalized 
the analytic study of time evolution to the case of stochastic but still linear biasing and 
showed how 6var and r approach unity in this case. Analytic attempts to study the evolution 
of mildly nonlinear stochastic biasing have been reported recently (Taruya, Koyama & Soda 
1998; Taruya & Soda 1998; Catelan et al. 1998; Catelan, Matarrese & Porciani 1998; Sheth 
& Lemson 1998). These studies can be extended to the general nonlinear case using our 
formalism. The simulations of Somerville et al. (1998) are aimed at this goal. 

More accurate measurements of peculiar velocities in our greater cosmological neighbor- 
hood, and careful comparisons to the galaxy distribution, promise to allow improved obser- 
vational estimates of the biasing scatter in the future. The reconstruction of the large-scale 
mass distribution based on weak gravitational lensing (Van Waerbeke 1998; 1999; Schneider 
1998; Kaiser et al. 1998) is also becoming promising for this purpose. 

The main moral of this paper is that in order to put any measurement of {3 in cosmo- 
logical perspective, and in particular when trying to use it for an accurate measurement of 
the cosmological parameter f2, one should consider the effects of nonlinear and stochastic 
biasing and the associated complications of scale dependence, time dependence, and type 
dependence. The current different estimates are expected to span a range of ~ 30% in (3 due 
to stochastic and nonlinear biasing. The analysis of redshift distortions seems to be most 
promising; once it is limited to the appropriate range of scales, the analysis is independent 
of stochasticity and the nonlinear effects are expected to be relatively small. 
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S.D.M. White for stimulating discussions. This research was supported in part by the US- 
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